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METHOD AND SYSTEM FOR PERFORMING SHIFT OPERATIONS 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention relates generally to computer 
architectures and, more particularly, to a method and an 
apparatus for shifting an operand a specified direction and 
amount . 

Description of Related Art 

Computer processors are constantly being designed with 
additional capabilities. In particular, some processors, 
such as the PowerPC designed by IBM Corp., Apple Computers 
Corp., and Motorola, Inc., are being modified to provide 
multimedia extensions, such as the Vector Multimedia 
Extension (VMX) . Some extensions such as a shift 

instruction, however, require additional processing and, 
therefore, may not be as efficient as desired. 

Shift instructions frequently utilize multiple operands 
to specify the desired action. A first operand typically 
specifies the operand that is to be shifted left or right, a 
second operand typically specifies the amount that the first 
operand is to be shifted, and a third operand typically 
specifies the location to place the result. Additional, or 
fewer, operands may be present depending on the 
implementation . 

Generally, the second operand that specifies the amount 
must be decoded to extract the necessary information and to 
format the information appropriately. The process of 
decoding the second operand, however, is processing- 
intensive and requires additional space. 
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Therefore, there is a need to provide a method and an 
apparatus for efficiently shifting a value a specified 
amount and direction. 

5 SUMMARY 

The present invention provides a method and an 
apparatus for performing a shift operation on an operand. 
The method and apparatus configures an input line comprising 
a first part that includes the bits in order representing 
10 various shift amounts in a first direction and a second part 
that includes bits ordered representing various shift 
amounts in a second direction. The shift amount is then 
utilized to index into the input line and select the 
appropriate bits to create the result. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
invention, and the advantages thereof, reference is now made 
to the following description taken in conjunction with the 
20 accompanying drawings, in which: 

FIGURE 1 is a schematic diagram of a typical circuit 
that embodies the present invention; 

FIGURE 2 is a schematic diagram of a circuit that 
illustrates one embodiment of the present invention that 
25 shifts an operand a specified direction and amount; and 

FIGURE 3 is a data flow diagram illustrating one 
embodiment of the present invention in which an operand is 
shifted a specified direction and amount. 

30 DETAILED DESCRIPTION 

In the following discussion, numerous specific details 
are set forth to provide a thorough understanding of the 
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present invention. However, it will be obvious to those 
skilled in the art that the present invention may be 
practiced without such specific details. In other 

instances, well-known elements have been illustrated in 
5 schematic or block diagram form in order not to obscure the 
present invention in unnecessary detail. Additionally, for 
the most part, details concerning operation and the 
connectivity of the individual components of the present 
invention, and the like, have been omitted inasmuch as such 
10 details are not considered necessary to obtain a complete 
understanding of the present invention, and are considered 
to be within the skills of persons of ordinary skill in the 
relevant art, 

It is further noted that, unless indicated otherwise, 

15 all functions described herein may be performed in either 
hardware or software, or some combination thereof. In a 
preferred embodiment, however, the functions are implemented 
in hardware in order to provide the most efficient 
implementation. Alternatively, the functions may be 

2 0 performed by a processor such as a computer or an electronic 
data processor in accordance with code such as computer 
program code, software, and/or integrated circuits that are 
coded to perform such functions, unless indicated otherwise. 
Referring to FIGURE 1 of the drawings, the reference 

25 numeral 100 generally designates a circuit, which may be 
particularly useful in performing a vector shift 
instruction, such as a Vector Shift Left Octet (vslo) , a 
Vector Shift Right Octet (vsro) , and/or the like, of the 
Vector Multimedia Extension (VMX) standard for the PowerPC 

30 designed and developed by Apple Computers, Inc., IBM, Corp., 
and Motorola, Inc., embodying features of the present 
invention. The circuit 100 is exemplified herein as 
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coexisting with another circuit, in this case a crossbar, 
commonly referred to as an XBAR function, that is capable of 
implementing many vector permute operations, such as vector 
packs, unpacks, merges, splats, and vector shift left double 
5 instructions, or the like, to demonstrate that additional 
efficiencies may be achieved by implementing the circuit 100 
in conjunction with another compatible circuit. The circuit 
100 may, however, be implemented with other types of 
circuits or individually. Furthermore, the discussion that 

10 follows and FIGURES 2 and 3 illustrate the present invention 
in terms of a 128-bit shift instruction for purposes of 
illustration only and should not be construed so as to limit 
the present invention in any manner. For example, the 
circuit 100 could be used with other types of circuits, 

15 individually, other bit lengths such as 64, 256, or the 
like, and the like, and is considered to be obvious to a 
person of ordinary skill in the art upon a reading of the 
present disclosure . 

The circuit 100 generally comprises a decoder 110 

20 configured for receiving and decoding an instruction 112. 
Generally, an instruction has the following format: 
operation VT,VA,VB 
where : 

operation is the requested action, such as vslo, 
25 vsro f or the like; 

VT specifies the destination of the output; 
VA specifies the operand to be shifted; and 
VB specifies the operand that contains the shift 
amount . 

30 The VA and VB operands may typically be any allowable 

data source (not shown), such as register files, bypasses, 
data forwarding, load ports, and/or the like, that is 
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selectable via a mux (not shown) . In FIG. 1, VA net 114 and 
VB net 12 6 represent the VA and VB operands, respectively, 
that are preferably sourced via mux from one or more data 
sources . 

5 Preferably, one or more bits of the operand specifying 

the shift amount, e.g., bits 121-124 of VB net 126 of a vslo 
and a vsro instruction, are coupled to a mux 128. The 
select line of the mux 128 is controlled by the decoder 110, 
which selects the shift amount bits and shift direction when 
10 the decoder determines that the instruction 112 is a vslo or 
vsro instruction. The mux 128 is configured to latch the 
shift amount from the VB net 12 6, and an indication of the 
shift direction from the decoder, referred to as the VC 
operand 130, to the shifter 122 and the crossbar 124. In 
15 the preferred embodiment, the VC operand 130 is configured 
as 16 byte slices (16 byte slices times 8 bits/byte is 128 
total bits) . The shift direction is preferably specified in 
bit 3, which is from the decoder 110, and the shift amount, 
which is from bits of the VB net 12 6, is preferably 
20 specified in bits 4-7 of each byte slice, bits 0-2 being 
unused. The shift direction is preferably set to a "0" to 
indicate a left shift and to a "1" to indicate a right 
shift. As will be appreciated by one skilled in the art, 
each byte slice contains the same value and will be in the 
25 range of 0-31, with 0-15 indicating a left shift of 0-15 
bytes and 16-31 indicating a right shift of 0-15 bytes. 

VB net 12 6 is also coupled to a mux 132, whose select 
line is controlled by the decoder 110. The mux 132 is 
configured to allow the decoder 110 to select between the 
30 sourced operand, i.e., VB net 126, or a constant zero value. 
In the event that the decoder 110 determines that the 
instruction 112 is a shift instruction, such as vslo, vsro, 



5 



Docket No. AUS9200107 99US1 PATENT APPLICATION 

or the like, the decoder 110 sets the select line of the mux 
132 such that the zero constant value is latched as a VB 
operand 134 to the shifter 122 and crossbar 124. 

It should be noted that the circuit 100 assumes that 
5 the shift instructions shifts the VA operand 120 left or 
right and fills the vacated positions with zeroes. 
Implementing other types of shifts, such as a circular 
shift, a signed shift, and/or the like, may be designed 
similarly and is considered obvious to a person of ordinary 
10 skill in the art upon a reading of the present disclosure. 

The shifter 122 and the crossbar 124 is configured to 
provide a shifter output 136 and an crossbar output 138, 
respectively, preferably to a mux 140, which is used to 
select the output of one or more circuits, based on the 
15 instruction 112. Accordingly, the shifter output 136 is 
chosen for vslo and vsro instructions, and the crossbar 
output 138 or any other output done in parallel with the 
shifter and crossbar are chosen for all other instructions. 
Alternatively, data forwarding (not shown) may be enabled in 
20 order to obtain greater efficiencies. 

FIGURE 2 is a schematic diagram depicting a circuit 
that may be designed to perform operations of the shifter 
122 (FIG.l) in accordance with one embodiment of the present 
invention that receives a 128-bit VA operand 120 (FIG. 1) to 
25 shift, a 128-bit VB (zero filled) operand 134 (FIG. 1), and 
a VC operand 130 (FIG. 1) that specifies the shift direction 
and the shift amount as discussed above. Accordingly, 
VA(xi-x 2 ) represents a byte corresponding to the xi fch bit to 
the x 2 th bit of the VA operand 12 0, VB(yi-y 2 ) represents a 
30 byte corresponding to the yi th bit to the y 2 th bit of the VB 
operand 134, and VC(zi-z 2 ) represents the z x th bit to the z 2 th 
bit of the VC operand 130. As noted above, the illustrated 
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shifter is a byte shifter. Shifters of other amounts, such 
as words or bits, may be designed and are considered to be 
within the skills of a person of ordinary skill in the art 
upon a reading of the present disclosure. 
5 As will be appreciated by one skilled in the art, the 

bits contained in each byte slice of the VC operand 130 that 
specify the shift direction and the shift amount, are used 
as a single 5-bit value ranging from 0-31. The 5-bit value 
is used as the select line of muxes 210 to select the 
10 corresponding byte of the designated input lines, forming 
the dout lines. The connection of the bits of the VA 
operand 120 and the VB operand 134 as shown shifts the VA 
operand to the left or right from 0 to 15 bytes, as 
specified by the 5-bit value. 
15 For example, a vslo shift instruction with a shift 

value of 1 results in a VC operand 130 with each byte slice 
containing a "xxxOOOOl" (binary), and, therefore, each 
select line of the muxes 210 are set to "1" (hex) . As a 
result, the byte corresponding to the second mux data line 
20 (the first mux data line representing a shift of zero) is 
selected: dout (0-7) =VA( 8-15) , dout (8-15) -VA (16-23) , . . 
dout (120-127)=VB(0-7) . This equates to a shift left by one 
byte, bringing in one byte from VB, which contains all 
zeros . 

25 For another example, a vsro shift instruction with a 

shift value of 15 results in a VC operand 130 with each byte 
slice containing "xxxlllll" (binary), and, therefore, each 
select line of the muxes 210 are set to "IF" (hex) . As a 
result, the byte corresponding to the thirty-second mux data 

30 line, i.e., the last mux data line, is selected: dout(0- 
7)=VB(8-15), dout (8-15)=VB(16-23) , . . dout (120- 

127)=VA(0-7) . This equates to a shift right by 15 bytes, 
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filling vacated bytes of the VA operand with zeroes from the 
VB operand. 

FIGURE 3 is a flow chart depicting steps that may be 
performed by the circuit 100 in accordance with one 
5 embodiment of the present invention that shifts an operand 
to the left or right a specified number of bytes. 
Processing begins in step 310, wherein a shift instruction, 
such as vslo and/or vsro, is received and decoded. The 
shift instruction is decoded to determine, among other 
10 things, the type of instruction and hence the shift 
direction for vslo and vsro instructions. Upon receiving 
the instruction, in steps 32 0 and 322, the VA operand and 
the VB operand is retrieved, respectively, from the 
specified source, such as a register file, bypass, load 
15 port, and/or the like, as is described above. 

In step 324, the VC operand is constructed. 
Preferably, as discussed above with reference to FIG. 1, the 
VC operand 130 comprises 16 bytes, each byte containing the 
shift direction in bit 3 and the shift amount in bits 4-7. 
20 Also after step 322, in step 326, the VB operand 134 is 
filled with zeroes for the vslo and vsro instructions. 

After steps 320, 324 and 326, processing proceeds to 
step 328, wherein the VA operand 120 is shifted in the 
amount and direction specified by the VC operand 130, 
25 filling the vacated bytes with the value of the VB operand 
134. 

It is understood that the present invention can take 
many forms and embodiments. Accordingly, several variations 
may be made in the foregoing without departing from the 
30 spirit or the scope of the invention. For example, 
different data widths, such as 8, 16, 32, 64, 256, 512, and 
the like may be used, differing bit positions may be used 
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for the VC mux selects , differing shift amounts, and/or 
different sources for the VC mux selects may be used. 

Having thus described the present invention by- 
reference to certain of its preferred embodiments, it is 
noted that the embodiments disclosed are illustrative rather 
than limiting in nature and that a wide range of variations, 
modifications, changes, and substitutions are contemplated 
in the foregoing disclosure and, in some instances, some 
features of the present invention may be employed without a 
corresponding use of the other features. Many such 
variations and modifications may be considered obvious and 
desirable by those skilled in the art based upon a review of 
the foregoing description of preferred embodiments. 
Accordingly, it is appropriate that the appended claims be 
construed broadly and in a manner consistent with the scope 
of the invention. 



